Monitoring and collecting medical data using embedded medical diagnostic devices with multiple sensors and sending these actual measured data to the corresponding health monitoring centers using multipurpose wireless networks to take necessary measures to coordinate with family medical service centers and regional medical service departments is a popular medical big data architecture. However, healthcare big data is characterized by large data volume, fast growth, multimodality, high value and privacy, etc. How to organize and manage it in a unified and efficient way is an important research direction at present. In response to the problems of low balance and poor security in the storage of data collected by distributed sensor networks in healthcare systems, we propose a distributed storage algorithm for big data in healthcare systems. The platform adopts Hadoop distributed file system and distributed file storage framework as the healthcare big data storage solution, and implements data integration, multidimensional data query and analysis mining components based on Spark-SQL data query tool, Spark machine learning algorithm library and its mining and analysis pipeline development, respectively. The distributed storage model of big data and three data storage levels are constructed using cloud storage architecture, and the data storage intensity as well as levels are calculated by high data access in the upper level, data connection in the middle level, and data archiving in the lower level according to the set known data granularity, odds, and elasticity to realize big data storage. It is experimentally verified that the above algorithm has high distribution balance and low load balance in the storage process.
Loading....